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Abstract 

We use recent results on algorithms for Markov decision problems to show that 
a canonical form for a generalized P-matrix can be computed, in some important 
cases, by a strongly polynomial algorithm. 

1 Introduction 

Suppose that the m x n real matrix A has the block form 

A=[A 1 | A 2 | ■ • • | A m ], 

with each column of block Aj being of the form ej — jPjk, with ej the j th standard basis 
vector and pjk a vector of nonnegative entries that sum to one. A Markov decision problem, 
as described in [11] , is a linear program of the form: maximize v T b, subject to v T A < c T 
where b G lR m is positive and c G ~R n . Two recent papers, pj] and subsequently [6], 
advanced the theory of such problems significantly. Ye proved that the simplex method, 
with Dantzig's rule for the entering variable, is a strongly polynomial algorithm to solve 
the Markov decision problem if the discount factor 7 is fixed. Such a positive result for 
complexity of a pivoting algorithm is rare. The paper of Hansen, Miltersen and Zwick 
showed a similar result for a considerable generalization of the Markov decision problem, 
in which one seeks a solution to the optimality conditions of the problem, with some 
inequalities reversed. 

It is natural to seek applications of the advances achieved in these recent papers. The 
sequence of papers by Kitihara and Mizuno, of which [7] is one example, generalizes Ye's 
results to other linear programs. 

The matrix A from a Markov decision problem has the P-property, which means that all 
square submatrices of A formed by taking one column from each block have determinants 
of the same non-zero sign. We consider the problem of satisfying the optimality conditions 
of the Markov decision problem for more general A with the P-property. This problem 
is known as the generalized LCP (see PQ, [5]). We introduce this problem in Section 
2. Theorem 1 of that section describes a canonical form for A with the P-property, 
introduced in [S] to describe the set of "n-step vectors" of an LCP. One of our goals is to 
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emphasize the significance of this form. Theorem 2 describes a subclass of matrices with 
the P-property for which the analyses of [UJ can be directly applied. This description is 
in terms of the canonical form of Theorem 1. Theorems 1 and 2 are not essentially new, 
but the proofs supplied are more streamlined than earlier proofs. 

Section 3 discusses the Markov decision problem. The complexity results of [llj and 
[6] have the term 1 — 7 in the denominator. This leads us to look for equivalent Markov 
decision problems for which this term is as large as possible. A linear program to find 
such an optimal problem is formulated. We call this linear program LP(A). 

Section 4 presents a two step algorithm to solve LP(A). This method finds the canon- 
ical form of Theorem 1 along the way. Section 5 shows that the two step algorithm to 
solve LP(A) is a strongly polynomial time algorithm, if the optimal value of LP(A) is a 
fixed positive number. 

2 The P-property 

Suppose that the matrix A G j| mxn has its column set indexed by the set of pairs 

{(j,k) :j = 1,2, ...,m,k= 1,2,..., rij}, 

where n\ + + • • • + n m = n. A representative submatrix of A is an m x m submatrix 
for which, for each j = 1, 2, . . . , m the j th column is indexed by (j, k) for some k. 

The matrix A has the P-property if the determinants of all of its representative sub- 
matrices have the same nonzero sign. (See Example 1 on Page 5 of this article.) Cottle 
and Dantzig [li], Theorem 3, proved the existence part of the following. (Uniqueness was 
proved by Szanc [10].) 

Proposition 1 Suppose that A has the P-property. If c G MJ 1 then there exists a unique 
vector v G M. m so that c T — v T A > 0, and for each j = 1,2, ... ,m, (c T — v T A)jk = for 
at least one k G {1, 2, ... , rij}. 

Let C be the representative matrix of A for which column j is column (j,n,j) of A, for 
j = 1,2, ... ,m. Form the matrix A from the columns (j, k) of C~ x A for which k < rij. 
Let Cat be the subvector of c with subscripts (j,k) only for k < rij. Then the vector 
z = C T v satisfies z > and w = c^ — z T A > 0, Zj Ylk^i 1 w jk = for j = 1, 2, . . . , m. 
Thus z is a solution to the generalized linear complementarity problem defined by A T and 
c. If nj = 2 for all j, then this generalized linear complementarity problem is a standard 
linear complementarity problem. 

A reformulation of the P-property that follows easily from [lj, Theorem 5, is that 
for every nonzero vector x in the row space of A, there is a j G {1,2, .. . ,m} so that 
the coordinates Xjk have the same nonzero sign for k = 1,2, . . . ,n,j. We call this the 
sign-preserving property of the vector x. Finally, Theorem 6 of pQ says that if A has the 
P-property, then the row space of A contains a positive vector. 

A polytope Vaj defined by the system Ax = b, x > is said to be combinatorially 
equivalent to a product of simplices if the solution to Cx = b is positive for every rep- 
resentative submatrix C of A. In that case, we say that the representative submatrices 
of A are the nondegenerate basic feasible solutions to Ax = b, x > 0. If nj = 2 for 
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j = 1, 2, . . . , m, then such a polytope VA,b is combinatorially equivalent to a cube. If VA,b 
is combinatorially equivalent to a product of simplices, then A has the P-property. 

A square matrix with nonnegative off-diagonal entries is called a Z-matrix. The fol- 
lowing classical theorem j3] is central for Z-matrices. 

Proposition 2 Let M be a Z-matrix. The following are equivalent. 

1. M is a P-matrix, i.e. all principal minors of M are positive. 

2. The system x > 0, Mx > has a solution. 

3. The inverse of M exists and is nonnegative. 

A Z-matrix M satisfying the conditions of Proposition 2 is called a K-matrix. 

Theorem 1 Suppose A has the P-property. Then there exists an m x m matrix X so 
that 

1. (XA)ijk > whenever i = j . 

2. (XA)ijk <= whenever i ^ j . 

3. For every pair (i,j) with i ^ j , there exists k G {1, 2, . . . , rij} so that (XA)ijk = 0. 
X is unique up to positive scaling of its rows. 

Proof. We recall the matrix A = C~ 1 A, where C is the representative matrix of 
A for which column j is column {j,nj) of A, for j = 1,2, ... ,m. Let A m be the ma- 
trix obtained from A by deleting row m and deleting columns (m, 1), (m, 2), . . . , (m, n m ). 
Also, define c to be —1 times the vector obtained from row m of A by deleting en- 
tries (m, 1), (m, 2), . . . , (m,n m ). Then A m has the P-property, so there is a vector v G 
M m_1 satisfying c T — v T A m > 0, and for every j ^ n there is a k G {1, 2, . . . , rij} 
so that (c T — v T A m )j k = 0. Appending a 1 to the end of vector v, we get (v, 1) T A 
which has the negative of the sign pattern of c T — v T A m on coordinates not indexed by 
(m, 1), (m, 2), . . . , (m, n m ), and which is 1 on component (m,n m ). The sign-preserving 
property of the row space of A then implies that (v, 1) T A is positive on all components 
indexed by (m, 1), (m, 2), . . . , (m, n m ). Thus (v, l) T C~ l can serve as row m of the matrix 
X. A similar construction can be carried out for each of the rows of A. I 
Theorem 1 was proved in [8j for the case \rij\ = 2 for all j. The algorithm in the 
Proof to construct X uses Proposition 1 applied to matrices with m — 1 rows. This 
improves slightly upon the original proof which applied Proposition 1 to matrices with m 
rows. Unfortunately, the complexity of finding the vector guaranteed by Proposition 1 is 
unknown for general A with the P-property. We will call the matrix A = XA given by 
the previous theorem the complementary Z-form of A. 

Theorem 2 Let A have the P-property. The following are equivalent: 
1. There exists a positive vector p G M m so that p 7 A > 0. 
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2. There exists a matrix X G ]j mxm an( ^ a positive vector p G M m so that XA satisfies 
conditions (1) and (2) of Theorem 1, andp T XA > 0. 

3. There exists a vector b G M. m so that for every representative submatrix C of A the 
solution to Cx = b is positive. 

Proof. Clearly, (1) implies (2). If (2) holds, then every representative submatrix of 
XA is a K-matrix. Let q be any positive vector. Then for any representative submatrix 
C of XA, the solution to Cx = q is positive. It follows that for every representative 
submatrix D = X C of A, the solution to Dx = b is positive for b = X q. Thus 
(3) holds. To show that (3) implies (1), suppose that the nondegenerate feasible bases 
of Ax = b, x > are the representative submatrices of A. Let C be a representative 
submatrix of XA for which C m j = 0, j = 1, 2, . . . , m — 1. The solution to Cx = Xb must 
be positive, and C mm is positive, so component m of Xb is positive. In a similar way, we 
see that all components of Xb are positive. It follows that every representative matrix 
of A is a K-matrix. We add a column (j, rij + 1), equal to the j th standard basis vector 
of IR' m , to A for each j = 1,2, ... ,m. The resulting matrix retains the P-property, by 
property (1) of Proposition 2 and so it has a positive vector in its row space. There is 
therefore a positive vector p so that p T A > 0. I 

The equivalence of (2) and (3) in Theorem 2 was already discussed in [5]. The equiv- 
alence of (1) to the other conditions was discussed for the case \rij\ = 2 for all j in [8] and 
[2], section 4.8. The proof above seems to be particularly straightforward. The matrix X 
is useful for several reasons. It provides a quick certificate for proving that the equivalent 
conditions of Theorem 2 do not hold. One needs only to display A and a representative 
submatrix C of A for which the system Cx < 0, x > 0, x ^ has a solution. A different 
reason arises if one wants to describe the set of vectors b for which the representative 
submatrices of A are the nondegenerate feasible bases of Ax = b,x > 0. This set is 
{X q : q > 0}, a simplicial cone. In the case where A = (I, M) where column j of / is 
indexed by (j, 1) and column j of a square matrix M is indexed by (j, 2) for all j, a vector 
b for which the ([2] section 4.8) representative submatrices of A are the nondegenerate 
feasible bases of Ax = b, x > is a so-called n-step vector for the matrix M and can be 
used to solve any linear complementarity problem with matrix M in at most m pivots. 
The location of the zeroes in A may offer useful combinatorial information. If \nj\ = 2 
for all j, Theorem 2 holds, and one of the representative submatrices of A is a diagonal 
matrix, then the paper [3] shows that the simplex method finds the vector guaranteed by 
Proposition 1 very quickly. 

It follows that the efficient computation of X is of interest. 

3 Discounted Markov Decision Problems 

A discounted Markov decision problem is defined by an m x n matrix A, with the columns 
indexed by pairs (j, k) as in the previous section, a vector c6l", and a scalar 7 between 
and 1. Every representative submatrix of A is of the form / — jP, with / the mxm 
identity matrix and P a nonnegative matrix with column sums equal to 1. The goal is to 
find a vector v G M m so that c T — v T A > 0, and for each j = 1, 2, . . . , m, (c T — v T A)jk = 
for at least one k G {1, 2, ... , rij}. 
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It is easy to see that the matrix A for a discounted Markov decision problem satisfies 
the P-property. Moreover, it satisfies condition (2) of Theorem 2 with X = I and p = e m , 
the vector of length m with all entries equal to one. 

Proposition 3 Suppose that the matrix A has the P-property and satisfies the equivalent 
conditions of Theorem 2. Then the problem of finding a v as in Proposition 1 is equivalent 
to solving a Markov decision problem. 

Proof. Suppose that A satisfies the equivalent conditions of Theorem 2. Then there 
exists X G ]R mx,n satisfying (2) of Theorem 2, and by scaling the rows of X we can 
assure that (e m ) T XA > 0. It will not generally be true that the entries of (e m ) T XA 
will all be equal, but we can enforce this by adding a row and column in a way that 
will not affect the application of any pivoting algorithm applied to the matrix XA^Let 
d = minu^jA Yli{XA)ij k . We can augment XA to an (m + 1) x (n + 1) matrix XA by 
first adding an (n + l) s * column, indexed by (m + 1, 1), of zeroes, and then adding a 
row of length n + 1 for which entry (j, k) is d — 'Y^™ = i{XA) i j k for j = 1, 2, . . . , m + 1, 

Jv — — 1,2,..., Tl j . 

Every representative submatrix of the matrix XA will contain the new column (m + 
1, 1). Suppose that c is a vector in R n , and let c be c with a appended as an (m + 1, 1) 
component. Suppose that C is a representative submatrix of_A with column indices in a 
set J C [n] and let C be the representative submatrix of XA with columns indexed by 
JU{(m+l,l)}. Let cj be the subvector of c with components indexed by J, and let c~j 
be the subvector of c with components indexed by JU{(m+l, 1)}. It is easy to see that a 
vector p G M m satisfies v T C = c~j if and only if v^ +1 = and the vector v' obtained from 
v by deleting the last component, satisfies {v') T XC = cj. Thus, any pivoting algorithm 
applied to solving the linear program: minimize c T x subject to Ax = b, x > 0, where the 
representative matrices of A are the nondegenerate feasible bases of Ax = b, x > 0, will 
go through the same steps as one applied to solving the Markov decision problem given 
by XA and c. I 

One example of a linear program that Proposition 2 shows is equivalent to a Markov 
decision problem is the Klee-Minty cube, studied in [7]. The transformation of the Klee- 
Minty cube into a Markov decision problem necessarily leads to a very small value of 
1 — 7. Two recent papers give polynomial time complexity results on pivoting algorithms 
for the discounted Markov decision problem. The paper [TTJ proves that Dantzig's rule 
of choosing the entering variable with the smallest reduced cost, applied to a discounted 
Markov decision problem with an m x n matrix and discount factor 7, takes at most 
m (™~™^ . i g (j^-) pivots to find the solution. The paper j6] proves that the strategy 
iteration method requires at most 0(j^log(j^) iterations. These bounds are comparable 
because an iteration of the strategy iteration method requires more work than an iteration 
of the simplex method. Both of these bounds are polynomials in n and m if 7 is not 
assumed to be part of the input data. One should note that the paper [S] solves a 
considerable generalization of the Markov decision problem with the same complexity. 
This generalization allows for a partition of the set [m] into two parts Mi and M%, and 
the vector v must satisfy (c T — v T A)j k > for j G Mi and (c T — v T A)j k < for j G 
M 2 , in addition to the complementarity condition which says that for each j G [m], 
(c T — v T A)j k = for some k G {1, 2, ... , nj}. 
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Given a matrix A with the P-property, it seems desirable to look for an equivalent 
Markov decision problem matrix with 1 — 7 as large as possible. This leads to the following 
linear program: 

maximize d subject to 
(XA)ij k < 1 whenever i — j 
(XA)ij k < whenever i ^ j 



(e m ) T XA > d(e n ) T 
This linear program is similar to one from [9]. 

We will call this linear program LP (A). It is clear that LP (A) always has the feasible 
solution X = 0, d = 0, and that LP (A) has a solution with positive optimal value if and 
only if one of the equivalent conditions of Theorem 2 is satisfied. 



Example 1 Let 



A 



4 


4 


-1 


-3 


-2 


-1 " 


-2 


-1 


4 


4 


-1 


-1 


-1 


-2 


-1 





4 


4 



Using Maple 11 to solve LP (A) yields the solution 



X 



111/350 37/350 37/350 
1/25 7/25 2/25 
4/35 3/35 2/7 



d = 33/70, 



and 



XA 



333/350 333/350 | -37/70 
-12/25 -7/25 I 1 1 
-1/5 I -2/35 



-111/350 
-1/25 
29/35 33/35 



In this example, the columns appear in the order (1, 1), (1, 2), (2, 1), (2, 2), (3, 1), (3, 2), 
and ni = n2 = n% = 2. Note that the matrix A already satisfies the sign conditions 
(1) and (2) of Theorem 1, and it satisfies condition (2) of Theorem 2. The matrix XA 
found by Maple 11 does not, however, satisfy condition (3) of Theorem 1, because neither 
(XA) 2 n nor (XA) 2 i2 is 0. This leads one to ask if there must be an optimum solution 
of LP(A), if the objective value of LP(A) is positive, for which all three conditions of 
Theorem 1 are satisfied. 



4 The two step method 

We will assume in this section that A has the P-property and that A satisfies the equivalent 
conditions of Theorem 2. We want to show that LP (A) can be computed in strongly 
polynomial time if the optimal value of LP (A) is a constant and not part of the input. In 
addition, an optimal X satisfying all three conditions of Theorem 1 will be found. First 
we recall the matrix A m from the proof of Theorem 1. 

Lemma 1 The matrix A m satisfies the equivalent conditions of Theorem 2. 
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Proof. Suppose X is as in Theorem 1. Element (XA) mmnm is positive. Let E be the ma- 
trix obtained from themxm identity matrix by replacing entry (j, m) by —(XA)j mrim / (XA) r 
for j = 1,2, ... , to — 1. It is easy to see that if p T XA > 0, then p T E~ l will be positive 
and satisfy (p T E~ 1 )(EXA) > 0. Form A from EX A by deleting row to and columns 
(to, 1), (to, 2), . . . , (m, n m ). Matrix A satisfies the equivalent conditions of Theorem 2. 
Furthermore, there exists an (to — 1) x (to — 1) matrix X m so that Tf™ = X"M m , where 
A m is from the proof of Theorem 1 . I 

Proposition 4 Row m of X can be found by solving a Markov decision problem with 
matrix A m . 

Proof. As in the proof of Theorem 1, define c to be —1 times the vector obtained 
from row to of A by deleting entries (to, 1), (m, 2), . . . , (to, n m ). Because A m satisfies 
the equivalent conditions of Theorem 2, there exists a vector b e R m_1 so that the 
representative submatrices of A m are the nondegenerate feasible bases of A m x = b, x > 0. 
The vector v that was found in the proof of Theorem 1 can be found as the dual solution 
to the linear program minimize c T x subject to A m x = b, x > 0. Note that one does 
not need to know b in order to solve this linear program by a pivoting method. If the 
entering variable is indexed by the pair (j,k), then the leaving variable is the currently 
basic variable indexed by (j, k') for some k' ^ k. For (v, 1) obtained by appending a 
1 to v we obtain (v, l)A, a vector in the row space of A that is nonpositive on entries 
not indexed by (to, 1), (to, 2), . . . , (to, n m ) and has for each j = 1,2, . . . , to — 1 an entry 
(j, k) that is zero. It is also positive on entry (m,n m ), so by the sign-preserving property 
of the row space of A it is positive on all components (to, 1), (to, 2), . . . , (to, n m ). Thus 
(v, 1) T (7 _1 can serve as row to of matrix X. I 
We can obtain the other rows of X similarly, by solving linear programs. 

Example 2 Applied to the previous example, we obtain 



X 



1/3 1/9 
3/19 7/19 
4/33 1/11 



1/9 
5/38 
10/33 



and 



1 1 | -5/9 

T 4 : -9/38 | 45/38 1 
-7/33 | -2/33 

Note that the entry in position (j,j,rij) is 1 for all j . 



-1/3 
-3/19 
29/33 1 



The second step of our method is to determine an optimum scaling of the rows of 
the matrix X. Let A = XA. The column sums of A are not necessarily between and 
1. We find an optimum scaling vector x G M m by solving the linear program: maximize 
d, subject to x T A > de n , XjAjj k < 1 for all j = 1,2, ... , to, k e [rij]. If diag(x) is the 
to x to diagonal matrix with x on the diagonal, we see that x T A = (e m ) T diag(x)A, so 
the vector x can be thought of as an optimal scaling of the rows of A to bring it into the 
MDP format. We call this linear program the scaling LP. The matrix diag(x) is feasible 
for LP (A), and it has the extra feature that condition (3) of Theorem 1 is satisfied. It is 
not immediately obvious, though, that the optimal value of the scaling LP is equal to the 
optimal value of LP(A). To prove the equality is our remaining goal. 
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Example 3 Applying the scaling LP to A from our previous example yields 



diag(x) 



47/70 
38/45 
33/35 



, d = 33/70 



and 



diag(x)XA 



47/70 47/70 | -47/126 
-1/5 ; 1 38/45 
-1/5 ; -2/35 



-47/210 
-2/15 
29/35 33/35 



Proposition 5 The linear program LP (A) has the same optimal value as the scaling LP. 

Proof. It is clear that the linear programs LP (A) and LP (A) have the same optimal 
value, so we want to show that the optimal values of LP (A) and the scaling LP are the 
same. Note that the optimal values of both LPs are positive. For the scaling LP, this 
implies, by (3) of Proposition 1, that the solution x will be nonnegative. We will compare 
the dual linear programs. The dual to LP (A) has variables y^ corresponding to entries 
of A and variables Wjk corresponding to columns of A. 



minimize E« 



-je[m],ke 



n ,y ijk subject to 



Ej 6H ,fceK] Ai'jWjk = £jeM,fcefo] A i'jkWjk for all i G [m],i' G [m\ ^ 

Sje[m],ke[raj] W jk = 1> 

y > 0,w > 

The scaling LP has variables yjjk corresponding to entries of A for which the first two 
indices are equal, and variables Wjk corresponding to columns of A. 

minimize £ i=je[m])fc6[nj .]j/yk subject to 
Efceini] AukVuk = E ie H,fceK] A ijk™jk for all i G [m] 



E 



je[m],ke[rij] 



y>0,w>0 

i = j} and {w* k } are an optimal solution to the dual of the scaling 



Suppose that {y* jk 

LP. We will create a solution (y,w) to the dual of LP (A) by solving systems of equations. 
We will keep the solution to the scaling LP: y ijk = y* jk whenever i — j and w 



w 



For each i = 1,2, ... ,m, we will form a linear system to solve for variables y^ k for 
j 7^ i. Given i, let Ki be a function that assigns to each j ^ i a pair (j, k) for which 
Aijk — 0- System % will have an equation for each i' ^ i: 

^2 A iijK l (j)yijK l (j)= Yl A i'jkW* k 



iky iiki 



je[m],fee[rtj 



fce[n»] 



It is easy to verify that the right hand side of this system is nonnegative. The coefficient 
matrix of the system is an (m— 1) x (m— 1) i^-matrix, so the system will have a nonnegative 
solution when the right hand side is nonnegative. The systems for different i have disjoint 
sets of variables, so they are solved independently. All variables y^ k not specified by the 
scaling LP or by the systems are set to 0. The resulting (y, w) is a feasible solution to the 
dual LP of LP(A). The objective value of this dual solution is the same as that of the 
scaling LP, because the variables in the two objective functions are the same. I 
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5 Complexity 



Now we assume that (X, d) solves LP (A) and that d > 0. We will assume that the matrix 
X satisfies conditions (1) — (3) of Theorem 1. We have (e m ) T XA > d(e n ) T . 

Proposition 6 Row m of X is found by a strongly polynomial algorithm, solving a 
Markov decision problem with matrix A m , if d is considered to be a constant. 

Proof. Let E be the matrix obtained from the mxm identity matrix by replacing entry 
(j, m) by -(XA) jmnm / [XA) mmnm for j = 1, 2, . . . , m - 1. Form A™ from EX A by delet- 
ing row m and columns (m, 1), (m, 2), . . . , (m, n m ). Then for any (j, k) with j £ [m— 1] and 
k £ [ nj ], we have d < (e m ) T A. jk = (e™" 1 )^™ +(l-£^(^^^ 

This last term is nonpositive, so (e m-1 ) T A jfc > d. It follows from [11] that the linear pro- 
gram to compute row m of X (without scaling) can be solved in at most m ("~ m ) . l g(jBL'} 
pivots using Dantzig's rule, or with similar complexity using the results of [B]. I 

Proposition 7 The scaling LP can be solved by applying the strongly polynomial algo- 
rithm of [6], again assuming d is a constant, followed by solving a one-variable linear 
program. 

Proof. Here we first find a vector v satisfying v T A > (e n ) T , such that for every j £ [m] 
there exists afcG [rij] such that component (j, k) of (e n ) T — v T A is zero. This vector can 
be found by the strongly polynomial algorithm of [6j. Then we find the largest d so that 
dviAuk < 1 for all i £ [m], k £ [m). Let (i, i, k) index an entry of d(diag{v))A that is equal 
to 1. Let C be a representative submatrix of d(diag(v))A satisfying (e m ) T C = d(e m ) T . 
We build a dual solution (y, w) to the scaling LP as follows. We let yak = d, and let 
the components of w corresponding to the columns of C satisfy Cw = de{. Let all other 
components of y and w be 0. Then the system Cw = dei guarantees that the constraints 

^2 AakViik = ^2 A ijkW jk for all % £ [m] 

k£[rii] j£[m],k£[nj] 

will be satisfied. Premultiplying both sides of Cw = dei by (e' m ) T gives (e m ) T Cw = 
d(e m ) T ei, or (e m ) T w = 1. Thus (y,w) is a feasible solution to the dual of the scaling LP, 
which has the same value as the solution dv of the primal LP. I 

Theorem 3 If the optimal value of the linear program LP (A) is positive, and this optimal 
value is considered a constant that is not part of the input data, then there is a strongly 
polynomial algorithm to solve LP {A). 

It is interesting that the two step method described here serves two seemingly indepen- 
dent purposes, to satisfy condition (3) of Theorem 1 and to be efficient. There does not 
appear to be reason to assume that the straightforward application of, say, the simplex 
method to LP (A) would be strongly polynomial. The solution diag(x)X found to LP(A) 
also has the property that for each j £ [m] there is an index (j, k) for which x T XAjk = d, 
where d is the optimal value of the LP. 
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